Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 4410 |
| Missing cells | 28 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 2912 |
| Duplicate rows (%) | 66.0% |
| Total size in memory | 689.2 KiB |
| Average record size in memory | 160.0 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 9 |
| BOOL | 1 |
Reproduction
| Analysis started | 2020-07-25 02:14:49.681573 |
|---|---|
| Analysis finished | 2020-07-25 02:15:28.705953 |
| Duration | 39.02 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Dataset has 2912 (66.0%) duplicate rows | Duplicates |
NumCompaniesWorked has 586 (13.3%) zeros | Zeros |
TrainingTimesLastYear has 162 (3.7%) zeros | Zeros |
YearsAtCompany has 132 (3.0%) zeros | Zeros |
YearsSinceLastPromotion has 1743 (39.5%) zeros | Zeros |
YearsWithCurrManager has 789 (17.9%) zeros | Zeros |
Age
Real number (ℝ≥0)
| Distinct count | 43 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.923809523809524 |
|---|---|
| Minimum | 18 |
| Maximum | 60 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 30 |
| median | 36 |
| Q3 | 43 |
| 95-th percentile | 54 |
| Maximum | 60 |
| Range | 42 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.133301271 |
|---|---|
| Coefficient of variation (CV) | 0.2473553349 |
| Kurtosis | -0.4059505398 |
| Mean | 36.92380952 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.4130049527 |
| Sum | 162834 |
| Variance | 83.41719211 |
| Value | Count | Frequency (%) | |
| 35 | 234 | 5.3% | |
| 34 | 231 | 5.2% | |
| 36 | 207 | 4.7% | |
| 31 | 207 | 4.7% | |
| 29 | 204 | 4.6% | |
| 32 | 183 | 4.1% | |
| 30 | 180 | 4.1% | |
| 38 | 174 | 3.9% | |
| 33 | 174 | 3.9% | |
| 40 | 171 | 3.9% | |
| Other values (33) | 2445 | 55.4% |
| Value | Count | Frequency (%) | |
| 18 | 24 | 0.5% | |
| 19 | 27 | 0.6% | |
| 20 | 33 | 0.7% | |
| 21 | 39 | 0.9% | |
| 22 | 48 | 1.1% |
| Value | Count | Frequency (%) | |
| 60 | 15 | 0.3% | |
| 59 | 30 | 0.7% | |
| 58 | 42 | 1.0% | |
| 57 | 12 | 0.3% | |
| 56 | 42 | 1.0% |
Attrition
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| 0 | |
|---|---|
| 1 | 711 |
| Value | Count | Frequency (%) | |
| 0 | 3699 | 83.9% | |
| 1 | 711 | 16.1% |
BusinessTravel
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| Travel_Rarely | |
|---|---|
| Travel_Frequently | |
| Non-Travel | 450 |
| Value | Count | Frequency (%) | |
| Travel_Rarely | 3129 | 71.0% | |
| Travel_Frequently | 831 | 18.8% | |
| Non-Travel | 450 | 10.2% |
Length
| Max length | 17 |
|---|---|
| Median length | 13 |
| Mean length | 13.44761905 |
| Min length | 10 |
Department
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| Research & Development | |
|---|---|
| Sales | |
| Human Resources | 189 |
| Value | Count | Frequency (%) | |
| Research & Development | 2883 | 65.4% | |
| Sales | 1338 | 30.3% | |
| Human Resources | 189 | 4.3% |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 16.54217687 |
| Min length | 5 |
DistanceFromHome
Real number (ℝ≥0)
| Distinct count | 29 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.19251700680272 |
|---|---|
| Minimum | 1 |
| Maximum | 29 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 7 |
| Q3 | 14 |
| 95-th percentile | 26 |
| Maximum | 29 |
| Range | 28 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 8.105025519 |
|---|---|
| Coefficient of variation (CV) | 0.8816981805 |
| Kurtosis | -0.2270453549 |
| Mean | 9.192517007 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.9574657464 |
| Sum | 40539 |
| Variance | 65.69143866 |
| Value | Count | Frequency (%) | |
| 2 | 633 | 14.4% | |
| 1 | 624 | 14.1% | |
| 10 | 258 | 5.9% | |
| 9 | 255 | 5.8% | |
| 7 | 252 | 5.7% | |
| 3 | 252 | 5.7% | |
| 8 | 240 | 5.4% | |
| 5 | 195 | 4.4% | |
| 4 | 192 | 4.4% | |
| 6 | 177 | 4.0% | |
| Other values (19) | 1332 | 30.2% |
| Value | Count | Frequency (%) | |
| 1 | 624 | 14.1% | |
| 2 | 633 | 14.4% | |
| 3 | 252 | 5.7% | |
| 4 | 192 | 4.4% | |
| 5 | 195 | 4.4% |
| Value | Count | Frequency (%) | |
| 29 | 81 | 1.8% | |
| 28 | 69 | 1.6% | |
| 27 | 36 | 0.8% | |
| 26 | 75 | 1.7% | |
| 25 | 75 | 1.7% |
Education
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| Bachelor | |
|---|---|
| Master | |
| College | |
| Below College | |
| Doctor | 144 |
| Value | Count | Frequency (%) | |
| Bachelor | 1716 | 38.9% | |
| Master | 1194 | 27.1% | |
| College | 846 | 19.2% | |
| Below College | 510 | 11.6% | |
| Doctor | 144 | 3.3% |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.779591837 |
| Min length | 6 |
EducationField
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| Life Sciences | |
|---|---|
| Medical | |
| Marketing | |
| Technical Degree | |
| Other | 246 |
| Value | Count | Frequency (%) | |
| Life Sciences | 1818 | 41.2% | |
| Medical | 1392 | 31.6% | |
| Marketing | 477 | 10.8% | |
| Technical Degree | 396 | 9.0% | |
| Other | 246 | 5.6% | |
| Human Resources | 81 | 1.8% |
Length
| Max length | 16 |
|---|---|
| Median length | 13 |
| Mean length | 10.53333333 |
| Min length | 5 |
Gender
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| Male | |
|---|---|
| Female |
| Value | Count | Frequency (%) | |
| Male | 2646 | 60.0% | |
| Female | 1764 | 40.0% |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.8 |
| Min length | 4 |
JobLevel
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| Low | |
|---|---|
| Medium | |
| High | |
| Very High | 318 |
| Exemplary | 207 |
| Value | Count | Frequency (%) | |
| Low | 1629 | 36.9% | |
| Medium | 1602 | 36.3% | |
| High | 654 | 14.8% | |
| Very High | 318 | 7.2% | |
| Exemplary | 207 | 4.7% |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 4.952380952 |
| Min length | 3 |
JobRole
Categorical
| Distinct count | 9 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| Sales Executive | |
|---|---|
| Research Scientist | |
| Laboratory Technician | |
| Manufacturing Director | |
| Healthcare Representative | |
| Other values (4) |
| Value | Count | Frequency (%) | |
| Sales Executive | 978 | 22.2% | |
| Research Scientist | 876 | 19.9% | |
| Laboratory Technician | 777 | 17.6% | |
| Manufacturing Director | 435 | 9.9% | |
| Healthcare Representative | 393 | 8.9% | |
| Manager | 306 | 6.9% | |
| Sales Representative | 249 | 5.6% | |
| Research Director | 240 | 5.4% | |
| Human Resources | 156 | 3.5% |
Length
| Max length | 25 |
|---|---|
| Median length | 18 |
| Mean length | 18.0707483 |
| Min length | 7 |
MaritalStatus
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| Married | |
|---|---|
| Single | |
| Divorced |
| Value | Count | Frequency (%) | |
| Married | 2019 | 45.8% | |
| Single | 1410 | 32.0% | |
| Divorced | 981 | 22.2% |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.902721088 |
| Min length | 6 |
MonthlyIncome
Real number (ℝ≥0)
| Distinct count | 1349 |
|---|---|
| Unique (%) | 30.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65029.31292517007 |
|---|---|
| Minimum | 10090 |
| Maximum | 199990 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 10090 |
|---|---|
| 5-th percentile | 20970 |
| Q1 | 29110 |
| median | 49190 |
| Q3 | 83800 |
| 95-th percentile | 178560 |
| Maximum | 199990 |
| Range | 189900 |
| Interquartile range (IQR) | 54690 |
Descriptive statistics
| Standard deviation | 47068.88856 |
|---|---|
| Coefficient of variation (CV) | 0.7238103317 |
| Kurtosis | 1.000231855 |
| Mean | 65029.31293 |
| Median Absolute Deviation (MAD) | 21990 |
| Skewness | 1.368884163 |
| Sum | 286779270 |
| Variance | 2215480270 |
| Value | Count | Frequency (%) | |
| 23420 | 12 | 0.3% | |
| 61420 | 9 | 0.2% | |
| 27410 | 9 | 0.2% | |
| 24040 | 9 | 0.2% | |
| 26100 | 9 | 0.2% | |
| 23800 | 9 | 0.2% | |
| 55620 | 9 | 0.2% | |
| 34520 | 9 | 0.2% | |
| 63470 | 9 | 0.2% | |
| 25590 | 9 | 0.2% | |
| Other values (1339) | 4317 | 97.9% |
| Value | Count | Frequency (%) | |
| 10090 | 3 | 0.1% | |
| 10510 | 3 | 0.1% | |
| 10520 | 3 | 0.1% | |
| 10810 | 3 | 0.1% | |
| 10910 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 199990 | 3 | 0.1% | |
| 199730 | 3 | 0.1% | |
| 199430 | 3 | 0.1% | |
| 199260 | 3 | 0.1% | |
| 198590 | 3 | 0.1% |
| Distinct count | 10 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 19 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6948303347756775 |
|---|---|
| Minimum | 0.0 |
| Maximum | 9.0 |
| Zeros | 586 |
| Zeros (%) | 13.3% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 8 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.498886889 |
|---|---|
| Coefficient of variation (CV) | 0.9272891345 |
| Kurtosis | 0.007287480878 |
| Mean | 2.694830335 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.026766676 |
| Sum | 11833 |
| Variance | 6.244435683 |
| Value | Count | Frequency (%) | |
| 1 | 1558 | 35.3% | |
| 0 | 586 | 13.3% | |
| 3 | 474 | 10.7% | |
| 2 | 438 | 9.9% | |
| 4 | 415 | 9.4% | |
| 7 | 222 | 5.0% | |
| 6 | 208 | 4.7% | |
| 5 | 187 | 4.2% | |
| 9 | 156 | 3.5% | |
| 8 | 147 | 3.3% | |
| (Missing) | 19 | 0.4% |
| Value | Count | Frequency (%) | |
| 0 | 586 | 13.3% | |
| 1 | 1558 | 35.3% | |
| 2 | 438 | 9.9% | |
| 3 | 474 | 10.7% | |
| 4 | 415 | 9.4% |
| Value | Count | Frequency (%) | |
| 9 | 156 | 3.5% | |
| 8 | 147 | 3.3% | |
| 7 | 222 | 5.0% | |
| 6 | 208 | 4.7% | |
| 5 | 187 | 4.2% |
PercentSalaryHike
Real number (ℝ≥0)
| Distinct count | 15 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.209523809523809 |
|---|---|
| Minimum | 11 |
| Maximum | 25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 12 |
| median | 14 |
| Q3 | 18 |
| 95-th percentile | 22 |
| Maximum | 25 |
| Range | 14 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.659107516 |
|---|---|
| Coefficient of variation (CV) | 0.2405800183 |
| Kurtosis | -0.3026383931 |
| Mean | 15.20952381 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.8205689838 |
| Sum | 67074 |
| Variance | 13.38906782 |
| Value | Count | Frequency (%) | |
| 11 | 630 | 14.3% | |
| 13 | 627 | 14.2% | |
| 14 | 603 | 13.7% | |
| 12 | 594 | 13.5% | |
| 15 | 303 | 6.9% | |
| 18 | 267 | 6.1% | |
| 17 | 246 | 5.6% | |
| 16 | 234 | 5.3% | |
| 19 | 228 | 5.2% | |
| 22 | 168 | 3.8% | |
| Other values (5) | 510 | 11.6% |
| Value | Count | Frequency (%) | |
| 11 | 630 | 14.3% | |
| 12 | 594 | 13.5% | |
| 13 | 627 | 14.2% | |
| 14 | 603 | 13.7% | |
| 15 | 303 | 6.9% |
| Value | Count | Frequency (%) | |
| 25 | 54 | 1.2% | |
| 24 | 63 | 1.4% | |
| 23 | 84 | 1.9% | |
| 22 | 168 | 3.8% | |
| 21 | 144 | 3.3% |
StockOptionLevel
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | |
| 3 | 255 |
| Value | Count | Frequency (%) | |
| 0 | 1893 | 42.9% | |
| 1 | 1788 | 40.5% | |
| 2 | 474 | 10.7% | |
| 3 | 255 | 5.8% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
TotalWorkingYears
Real number (ℝ≥0)
| Distinct count | 40 |
|---|---|
| Unique (%) | 0.9% |
| Missing | 9 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.279936378095888 |
|---|---|
| Minimum | 0.0 |
| Maximum | 40.0 |
| Zeros | 33 |
| Zeros (%) | 0.7% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 10 |
| Q3 | 15 |
| 95-th percentile | 28 |
| Maximum | 40 |
| Range | 40 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 7.782222141 |
|---|---|
| Coefficient of variation (CV) | 0.6899172017 |
| Kurtosis | 0.9129359961 |
| Mean | 11.27993638 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.116831796 |
| Sum | 49643 |
| Variance | 60.56298145 |
| Value | Count | Frequency (%) | |
| 10 | 605 | 13.7% | |
| 6 | 375 | 8.5% | |
| 8 | 307 | 7.0% | |
| 9 | 287 | 6.5% | |
| 5 | 264 | 6.0% | |
| 7 | 243 | 5.5% | |
| 1 | 242 | 5.5% | |
| 4 | 189 | 4.3% | |
| 12 | 144 | 3.3% | |
| 3 | 126 | 2.9% | |
| Other values (30) | 1619 | 36.7% |
| Value | Count | Frequency (%) | |
| 0 | 33 | 0.7% | |
| 1 | 242 | 5.5% | |
| 2 | 93 | 2.1% | |
| 3 | 126 | 2.9% | |
| 4 | 189 | 4.3% |
| Value | Count | Frequency (%) | |
| 40 | 6 | 0.1% | |
| 38 | 3 | 0.1% | |
| 37 | 12 | 0.3% | |
| 36 | 18 | 0.4% | |
| 35 | 9 | 0.2% |
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7993197278911564 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 162 |
| Zeros (%) | 3.7% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.28897817 |
|---|---|
| Coefficient of variation (CV) | 0.4604612174 |
| Kurtosis | 0.4911489985 |
| Mean | 2.799319728 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.5527476257 |
| Sum | 12345 |
| Variance | 1.661464722 |
| Value | Count | Frequency (%) | |
| 2 | 1641 | 37.2% | |
| 3 | 1473 | 33.4% | |
| 4 | 369 | 8.4% | |
| 5 | 357 | 8.1% | |
| 1 | 213 | 4.8% | |
| 6 | 195 | 4.4% | |
| 0 | 162 | 3.7% |
| Value | Count | Frequency (%) | |
| 0 | 162 | 3.7% | |
| 1 | 213 | 4.8% | |
| 2 | 1641 | 37.2% | |
| 3 | 1473 | 33.4% | |
| 4 | 369 | 8.4% |
| Value | Count | Frequency (%) | |
| 6 | 195 | 4.4% | |
| 5 | 357 | 8.1% | |
| 4 | 369 | 8.4% | |
| 3 | 1473 | 33.4% | |
| 2 | 1641 | 37.2% |
| Distinct count | 37 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.0081632653061225 |
|---|---|
| Minimum | 0 |
| Maximum | 40 |
| Zeros | 132 |
| Zeros (%) | 3.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 9 |
| 95-th percentile | 20 |
| Maximum | 40 |
| Range | 40 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 6.125135445 |
|---|---|
| Coefficient of variation (CV) | 0.8740001072 |
| Kurtosis | 3.923864205 |
| Mean | 7.008163265 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.763328232 |
| Sum | 30906 |
| Variance | 37.51728422 |
| Value | Count | Frequency (%) | |
| 5 | 588 | 13.3% | |
| 1 | 513 | 11.6% | |
| 3 | 384 | 8.7% | |
| 2 | 381 | 8.6% | |
| 10 | 360 | 8.2% | |
| 4 | 330 | 7.5% | |
| 7 | 270 | 6.1% | |
| 9 | 246 | 5.6% | |
| 8 | 240 | 5.4% | |
| 6 | 228 | 5.2% | |
| Other values (27) | 870 | 19.7% |
| Value | Count | Frequency (%) | |
| 0 | 132 | 3.0% | |
| 1 | 513 | 11.6% | |
| 2 | 381 | 8.6% | |
| 3 | 384 | 8.7% | |
| 4 | 330 | 7.5% |
| Value | Count | Frequency (%) | |
| 40 | 3 | 0.1% | |
| 37 | 3 | 0.1% | |
| 36 | 6 | 0.1% | |
| 34 | 3 | 0.1% | |
| 33 | 15 | 0.3% |
| Distinct count | 16 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1877551020408164 |
|---|---|
| Minimum | 0 |
| Maximum | 15 |
| Zeros | 1743 |
| Zeros (%) | 39.5% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 9 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.221699321 |
|---|---|
| Coefficient of variation (CV) | 1.4726051 |
| Kurtosis | 3.601760518 |
| Mean | 2.187755102 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.982939156 |
| Sum | 9648 |
| Variance | 10.37934651 |
| Value | Count | Frequency (%) | |
| 0 | 1743 | 39.5% | |
| 1 | 1071 | 24.3% | |
| 2 | 477 | 10.8% | |
| 7 | 228 | 5.2% | |
| 4 | 183 | 4.1% | |
| 3 | 156 | 3.5% | |
| 5 | 135 | 3.1% | |
| 6 | 96 | 2.2% | |
| 11 | 72 | 1.6% | |
| 8 | 54 | 1.2% | |
| Other values (6) | 195 | 4.4% |
| Value | Count | Frequency (%) | |
| 0 | 1743 | 39.5% | |
| 1 | 1071 | 24.3% | |
| 2 | 477 | 10.8% | |
| 3 | 156 | 3.5% | |
| 4 | 183 | 4.1% |
| Value | Count | Frequency (%) | |
| 15 | 39 | 0.9% | |
| 14 | 27 | 0.6% | |
| 13 | 30 | 0.7% | |
| 12 | 30 | 0.7% | |
| 11 | 72 | 1.6% |
| Distinct count | 18 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.12312925170068 |
|---|---|
| Minimum | 0 |
| Maximum | 17 |
| Zeros | 789 |
| Zeros (%) | 17.9% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 3 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.567326744 |
|---|---|
| Coefficient of variation (CV) | 0.8651988638 |
| Kurtosis | 0.1679485428 |
| Mean | 4.123129252 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.8328836111 |
| Sum | 18183 |
| Variance | 12.7258201 |
| Value | Count | Frequency (%) | |
| 2 | 1032 | 23.4% | |
| 0 | 789 | 17.9% | |
| 7 | 648 | 14.7% | |
| 3 | 426 | 9.7% | |
| 8 | 321 | 7.3% | |
| 4 | 294 | 6.7% | |
| 1 | 228 | 5.2% | |
| 9 | 192 | 4.4% | |
| 5 | 93 | 2.1% | |
| 6 | 87 | 2.0% | |
| Other values (8) | 300 | 6.8% |
| Value | Count | Frequency (%) | |
| 0 | 789 | 17.9% | |
| 1 | 228 | 5.2% | |
| 2 | 1032 | 23.4% | |
| 3 | 426 | 9.7% | |
| 4 | 294 | 6.7% |
| Value | Count | Frequency (%) | |
| 17 | 21 | 0.5% | |
| 16 | 6 | 0.1% | |
| 15 | 15 | 0.3% | |
| 14 | 15 | 0.3% | |
| 13 | 42 | 1.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Age | Attrition | BusinessTravel | Department | DistanceFromHome | Education | EducationField | Gender | JobLevel | JobRole | MaritalStatus | MonthlyIncome | NumCompaniesWorked | PercentSalaryHike | StockOptionLevel | TotalWorkingYears | TrainingTimesLastYear | YearsAtCompany | YearsSinceLastPromotion | YearsWithCurrManager | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 51 | 0 | Travel_Rarely | Sales | 6 | College | Life Sciences | Female | Low | Healthcare Representative | Married | 131160 | 1.0 | 11 | 0 | 1.0 | 6 | 1 | 0 | 0 |
| 1 | 31 | 1 | Travel_Frequently | Research & Development | 10 | Below College | Life Sciences | Female | Low | Research Scientist | Single | 41890 | 0.0 | 23 | 1 | 6.0 | 3 | 5 | 1 | 4 |
| 2 | 32 | 0 | Travel_Frequently | Research & Development | 17 | Master | Other | Male | Very High | Sales Executive | Married | 193280 | 1.0 | 15 | 3 | 5.0 | 2 | 5 | 0 | 3 |
| 3 | 38 | 0 | Non-Travel | Research & Development | 2 | Doctor | Life Sciences | Male | High | Human Resources | Married | 83210 | 3.0 | 11 | 3 | 13.0 | 5 | 8 | 7 | 5 |
| 4 | 32 | 0 | Travel_Rarely | Research & Development | 10 | Below College | Medical | Male | Low | Sales Executive | Single | 23420 | 4.0 | 12 | 2 | 9.0 | 2 | 6 | 0 | 4 |
| 5 | 46 | 0 | Travel_Rarely | Research & Development | 8 | Bachelor | Life Sciences | Female | Very High | Research Director | Married | 40710 | 3.0 | 13 | 0 | 28.0 | 5 | 7 | 7 | 7 |
| 6 | 28 | 1 | Travel_Rarely | Research & Development | 11 | College | Medical | Male | Medium | Sales Executive | Single | 58130 | 2.0 | 20 | 1 | 5.0 | 2 | 0 | 0 | 0 |
| 7 | 29 | 0 | Travel_Rarely | Research & Development | 18 | Bachelor | Life Sciences | Male | Medium | Sales Executive | Married | 31430 | 2.0 | 22 | 3 | 10.0 | 2 | 0 | 0 | 0 |
| 8 | 31 | 0 | Travel_Rarely | Research & Development | 1 | Bachelor | Life Sciences | Male | High | Laboratory Technician | Married | 20440 | 0.0 | 21 | 0 | 10.0 | 2 | 9 | 7 | 8 |
| 9 | 25 | 0 | Non-Travel | Research & Development | 7 | Master | Medical | Female | Very High | Laboratory Technician | Divorced | 134640 | 1.0 | 13 | 1 | 6.0 | 2 | 6 | 1 | 5 |
Last rows
| Age | Attrition | BusinessTravel | Department | DistanceFromHome | Education | EducationField | Gender | JobLevel | JobRole | MaritalStatus | MonthlyIncome | NumCompaniesWorked | PercentSalaryHike | StockOptionLevel | TotalWorkingYears | TrainingTimesLastYear | YearsAtCompany | YearsSinceLastPromotion | YearsWithCurrManager | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4400 | 37 | 0 | Travel_Rarely | Research & Development | 22 | Doctor | Medical | Female | Medium | Manufacturing Director | Married | 30550 | 2.0 | 14 | 3 | 17.0 | 3 | 3 | 0 | 2 |
| 4401 | 45 | 0 | Travel_Frequently | Sales | 21 | Below College | Marketing | Male | High | Research Scientist | Married | 22890 | 4.0 | 13 | 0 | 9.0 | 3 | 3 | 0 | 2 |
| 4402 | 37 | 1 | Travel_Frequently | Sales | 2 | Bachelor | Marketing | Male | Low | Laboratory Technician | Divorced | 40010 | 6.0 | 11 | 1 | 17.0 | 2 | 1 | 0 | 0 |
| 4403 | 39 | 0 | Travel_Frequently | Research & Development | 22 | Bachelor | Medical | Female | Low | Manufacturing Director | Single | 129650 | 0.0 | 19 | 1 | 20.0 | 2 | 19 | 11 | 8 |
| 4404 | 29 | 0 | Travel_Rarely | Sales | 4 | Bachelor | Other | Female | Medium | Human Resources | Single | 35390 | 1.0 | 18 | 0 | 6.0 | 2 | 6 | 1 | 5 |
| 4405 | 42 | 0 | Travel_Rarely | Research & Development | 5 | Master | Medical | Female | Low | Research Scientist | Single | 60290 | 3.0 | 17 | 1 | 10.0 | 5 | 3 | 0 | 2 |
| 4406 | 29 | 0 | Travel_Rarely | Research & Development | 2 | Master | Medical | Male | Low | Laboratory Technician | Divorced | 26790 | 2.0 | 15 | 0 | 10.0 | 2 | 3 | 0 | 2 |
| 4407 | 25 | 0 | Travel_Rarely | Research & Development | 25 | College | Life Sciences | Male | Medium | Sales Executive | Married | 37020 | 0.0 | 20 | 0 | 5.0 | 4 | 4 | 1 | 2 |
| 4408 | 42 | 0 | Travel_Rarely | Sales | 18 | College | Medical | Male | Low | Laboratory Technician | Divorced | 23980 | 0.0 | 14 | 1 | 10.0 | 2 | 9 | 7 | 8 |
| 4409 | 40 | 0 | Travel_Rarely | Research & Development | 28 | Bachelor | Medical | Male | Medium | Laboratory Technician | Divorced | 54680 | 0.0 | 12 | 0 | NaN | 6 | 21 | 3 | 9 |
Most frequent
| Age | Attrition | BusinessTravel | Department | DistanceFromHome | Education | EducationField | Gender | JobLevel | JobRole | MaritalStatus | MonthlyIncome | NumCompaniesWorked | PercentSalaryHike | StockOptionLevel | TotalWorkingYears | TrainingTimesLastYear | YearsAtCompany | YearsSinceLastPromotion | YearsWithCurrManager | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 18 | 0 | Non-Travel | Research & Development | 1 | Master | Medical | Male | Medium | Sales Executive | Single | 27200 | 1.0 | 22 | 1 | 0.0 | 2 | 0 | 0 | 0 | 3 |
| 1 | 18 | 0 | Non-Travel | Research & Development | 2 | Bachelor | Life Sciences | Male | High | Sales Representative | Single | 186060 | 1.0 | 24 | 2 | 0.0 | 4 | 0 | 0 | 0 | 3 |
| 2 | 18 | 0 | Non-Travel | Sales | 5 | Master | Other | Male | Medium | Manager | Single | 32300 | 1.0 | 12 | 1 | 0.0 | 3 | 0 | 0 | 0 | 3 |
| 3 | 18 | 0 | Travel_Rarely | Sales | 7 | Bachelor | Life Sciences | Male | Low | Research Scientist | Single | 38120 | 1.0 | 15 | 0 | 0.0 | 3 | 0 | 0 | 0 | 3 |
| 4 | 18 | 1 | Non-Travel | Research & Development | 2 | Master | Medical | Male | High | Laboratory Technician | Single | 109650 | 1.0 | 18 | 0 | 0.0 | 5 | 0 | 0 | 0 | 3 |
| 5 | 18 | 1 | Travel_Frequently | Research & Development | 2 | Bachelor | Technical Degree | Male | Low | Sales Executive | Single | 34680 | 1.0 | 18 | 2 | 0.0 | 4 | 0 | 0 | 0 | 3 |
| 7 | 18 | 1 | Travel_Rarely | Research & Development | 1 | Master | Life Sciences | Male | Low | Sales Executive | Single | 23350 | 1.0 | 14 | 2 | 0.0 | 3 | 0 | 0 | 0 | 3 |
| 8 | 19 | 0 | Travel_Rarely | Research & Development | 1 | Bachelor | Other | Female | Exemplary | Manufacturing Director | Single | 152020 | 1.0 | 18 | 3 | 1.0 | 2 | 1 | 0 | 0 | 3 |
| 9 | 19 | 0 | Travel_Rarely | Research & Development | 23 | Master | Life Sciences | Male | Medium | Laboratory Technician | Single | 191970 | 1.0 | 12 | 0 | 1.0 | 2 | 1 | 0 | 0 | 3 |
| 10 | 19 | 0 | Travel_Rarely | Sales | 2 | Master | Marketing | Male | High | Laboratory Technician | Single | 115570 | 1.0 | 22 | 0 | 1.0 | 0 | 1 | 0 | 1 | 3 |